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Abstract 

Many  have  argued  that  if  labor  hicome  is  difference  stationary,  the  permanent 
income  hypothesis  predicts  consumption  should  be  relatively  volatile.  In  US 
an-gregate  data,  labor  income  is  well-characterized  as  having  a  unit  root;  how- 
ever consumption  turns  out  to  be  relatively  smooth.  This  anomaly  is  known  as 
Deaton's  paradox.  We  resolve  Deaton^s  paradox  by  providing  decompositions 
of  labor  income  into  permanent  and  transitory  components.  These  preserve  the 
univariate  dynamic  properties  of  labor  income.  However,  when  agents  distin- 
guish permanent  and  transitory  movements  in  their  labor  income — as  the  rational 
expectations  hypothesis  asserts  they  should — the  permanent  income  hypothesis 
correctly  predicts  the  observed  smoothness  in  consumption. 
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1.  Introduction 

Milton  Friedman's  pern:ianent  income  theory  of  consumption  is  one  of  the  outstanding 
successes  of  dynamic  economic  reasoning.  This  theory  asserts  that  consumption  occurs 
out  of  permanent  income,  not  current  income.  Permanent  income  is  related  to  but  dis- 
tinct from  current  observed  income.  Under  the  intuition  that  permanent  income— because 
it  is  "permanent" — should  be  smoother  than  current  income,  the  theory  has  long  been 
understood  to  predict  that  consumption  should  be  smooth  relative  to  observed  income 
fluctuations.  This  relative  smoothness  of  consumption  is  a  firmly  estabhshed  empirical 
regularity  in  aggregate  time  series  data. 

Deaton  [1987]  observed,  however,  that  the  permanent  income  hypothesis  fails  to  gen- 
erate this  smoothness  if  labor  income  is  an  integrated  process,  i.e.,  if  labor  income  has  a 
unit  root.  According  to  Deaton,  a  unit  root  characterization  for  labor  income,  given  the 
data,  implies  that  observed  consumption  is  insufficiently  sensitive  to  innovations  in  current 
income.  Deaton  concluded  that  if  labor  income  is  well-characterized  as  being  integrated, 
then  "the  representative  agent  version  of  the  permanent-income  hypothesis  can  be  rejected 
because  it  fails  to  predict  the  fact  that  consumption  is  smooth,  the  very  fact  that  it  was 
invented  to  explain  in  the  first  place"  (Deaton  [1987,  p.  122]).  Tliis  anomaly  in  the  joint 
behavior  of  consumption  and  income  has  come  to  be  known  as  Deaton 's  paradox. 

Deaton's  analysis,  therefore,  appears  to  argue  strongly  for  the  need  to  estabhsh 
whether  labor  income  truly  is  an  integrated  process.  His  work  has  suggested  that  the  pre- 
dictions of  an  important  economic  theory — Friedman's  permanent  income  hypothesis — are 
intimately  related  to  measures  of  long  run  persistence,  such  as  considered  by  Campbell 
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and  Mankiw  [1987]  and  Cochrane  [1988].^ 

This  paper  offers  a  simple  and  intuitive  explanation  for  this  smoothness  in  consunnp- 
tion.  There  are  different  kinds  of  disturbances  that  impinge  on  the  labor  income  stream. 
Some  disturbances  have  permanent  effects  on  labor  income;  other  disturbances  have  only 
transitory  impact.  I  show^  below  that  the  permanent  income  hypothesis  under  rational 
expectations — not  surprisingly — imphes  different  kinds  of  disturbances  have  different  ef- 
fects on  consumption.  Disturbances  that  do  not  have  permanent  effects  on  labor  income 
will  not  have  large  effects  on  Friedman's  permanent  income.  These  disturbances  will  there- 
fore have  only  relatively  small  impact  on  consumption.  On  the  other  hand,  disturbances 
that  do  have  a  permanent  impact  on  labor  income  will  have  relatively  large  effects  on 
consumption.^ 

Therefore,  according  to  the  analysis  here,  the  permanent  income  hypothesis  predic- 
tion for  the  smoothness  properties  of  consumption  depends  on  the  relative  importance  of 
permanent  and  transitory  components  in  labor  inconae.  The  univariate  dynamics  of  labor 
income — whether  or  not  labor  income  is  integrated,  or  how  "persistent"  labor  income  is, 
or  the  precise  form  of  the  univariate  dynamics — turn  out  to  be  not  especially  informative 
for  the  predictions  of  the  theory. 

It  remains  controversial  whether  macroeconomic  time  series  are  better  characterized 
as  being  integrated  or  as  being  stationary  about  a  deterministic  trend.  This  paper  does  not 
attempt  to  shed  Hght  on  that  issue.  Instead  it  argues  only  that  at  least  within  the  context 
of  the  permanent  income  hypothesis,  a  unit  root  characterization  for  labor  income  may 
not  have  implications  that  are  as  dramatic  as  has  previously  been  suggested.  Further,  and 
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again  at  least  within  the  context  of  the  permanent  income  hypothesis,  the  widely-noted 
measures  of  persistence  in  Campbell  and  Mankiw  [1987]  and  Cochrane  [1988]  may  simply 
be  beside  the  point. ^ 

The  remainder  of  this  paper  is  organized  as  follows.  Section  2  briefly  reviews  other  ex- 
planations of  "excess  smoothness"'  that  have  been  offered,  and  makes  expHcit  the  difference 
between  those  and  the  reasoning  in  this  paper.  Section  3  sets  out  the  standard  permanent 
income  model,  and  makes  rigorous  the  intuition  above.  Section  4  provides  decompositions 
of  labor  income  into  permanent  and  transitory  components  that  reconcile  the  following:  (1) 
the  observed  smoothness  in  aggregate  consumption,  (2)  the  estimated  univariate  dynamics 
of  labor  income  maintaining  a  unit  root  characterization,  and  (3)  the  permanent  income 
hypothesis.^  Our  explanation  for  apparent  "excess  smoothness"  in  consumption  turns  on 
the  plausible  assumption  that  economic  agents  forecast  future  labor  income  using  strictly 
more  information  than  does  the  econometrician.  In  many  rational  expectations  models, 
the  econometrician  can  take  into  account  this  superior  information  of  private  agents  by 
using  the  endogenous  variables  of  the  model  in  the  modeller's  forecasting  equations.  Sec- 
tion 5  shows  the  permanent  income  model  of  consumption  to  be  a  counter-example  to  the 
validity  of  this  methodology:  A  researcher  studying  the  observed  behavior  of  the  model 
variables  would  conclude  that  consumption  is  unresponsive  to  "news"  in  labor  income, 
even  if  the  permanent  income  hypothesis  were  to  be  true.^  Section  6  concludes  the  paper. 


2.  Related  Literature 

Beaton's  paradox  has  generated  an  extensive  literature.  For  reasons  of  space,  we  will  only 
discuss  a  small  fraction  of  the  relevant  work:  Christiano  [1987]  and  Diebold  and  Rudebusch 
[1989]  provide  more  extended  discussion  on  the  Hterature. 

Christiano  [1987]  observes  that  movements  in  labor  income  may  be  related  to  interest 
rate  fluctuations.  To  the  extent  that  savings  are  sensitive  to  interest  rate  movements,  equi- 
Hbrium  changes  in  consumption  wiU  be  dampened  by  appropriate  comovemcnts  in  income 
and  the  interest  rate.  Thus,  conditional  on  a  given  pattern  of  labor  income  dynamics, 
an  equihbrium  theory  might,  in  principle,  predict  consumption  to  be  less  volatile  than 
impUed  by  a  model  with  a  constant  interest  rate.  Christiano  therefore  studies  a  simple 
general  equihbrium  real  business  cycle  model  that  allows  the  interest  rate  to  vary  over  time. 
By  appropriately  setting  parameter  values,  he  is  able  to  match  the  observed  volatiUty  of 
changes  in  consumption.  Christiano  points  out,  however,  that  in  doing  so,  the  model  is 
unable  to  rephcate  the  actual  dynamic  behavior  of  income  in  the  US  economy. 

Caballero  [l9S8a]  modifies  the  preferences  of  the  infinitely-Hved  representative  con- 
sumer to  allow  "taste  shocks,"  and  to  incorporate  an  exphcit  "precautionary  savings" 
motive.  Cla.rida  [1988]  and  GaH  [19S9]  consider  the  aggregation  problem  in  inflnitely-hved 
model  economies  that  are  populated  by  finite-Hved  consumers.  These  modifications  parr 
tially  succeed  in  reconcihng  the  predictions  of  the  theory  with  the  data.  They  aU  suggesf 
that  even  in  the  presence  of  a  unit  root  in  labor  income,  equilibrium  theory  predicts  that 
consumption  may  be  relatively  smooth.  Note,  however,  none  of  these  proposals  quite 
confront  the  challenge  that  Beaton  posed. ^ 
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Tliis  same  comment  applies  to  that  class  of  explanations  that  suggest  labor  income 
may  in  fact  not  be  difference  stationary,  or  that  even  if  it  were  difference  stationary,  the 
usual  estimates  of  long-run  persistence  may  simply  be  "too  large."  Dicbold  and  Rudebusch 
[1989]  suggested  a  fractional  integration  model  for  labor  income;  Watson  [198G]  used  an 
unobserved  components  model.  In  US  aggregate  data,  these  alternative  parametrizations 
of  the  Wold  moving  average  representation  imply  point  estimates  for  long  run  persistence 
smaller  than  those  in  Deaton  [1987]  or  West  [1988].  Again,  these  suggestions  partially 
succeed  in  reconcihng  the  optimizing  theory  with  the  data.  As  does  Cochrane  [1988], 
these  papers  properly  warn  that  estimates  of  long  run  persistence  may  be  quite  sensitive 
to  specification.  However,  according  to  the  analysis  developed  below,  the  estimates  of  long 
run  persistence  are  simply  not  especially  relevant. 

The  conclusions  of  this  paper  rely  on  the  researcher  having  strictly  less  information 
than  agents.  In  many  rational  expectations  applications,  this  is  not  important  as  the 
model  variables  will  reveal  all  relevant  information.  This  insight  underlies  the  many  Euler 
equation-type  tests  of  market  efficiency  and  equilibrium  models.  In  the  current  setting 
however,  when  there  is  more  than  one  disturbance  affecting  labor  income,  the  permanent 
income  hypothesis  also  predicts  that  the  model  variables,  consumption  and  income,  can  not 
appropriately  reveal  the  true  effects  on  consumption  of  "news"  in  labor  income.  In  fact,  an 
example  below  shows  that  an  econometrician  studying  the  joint  dynamics  of  consumption 
and  income  will  conclude  that  consumption  seems  not  to  respond  to  certain  "news"  in  labor 
income — even  when  the  permanent  income  hypothesis  is  true.  Thus,  the  econometrician 
wiU  conclude  that  consumption  appears  to  be  "excessively  smooth,"  even  though  in  truth, 
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it  isn't.  Note  however  that  this  does  not  explain  the  rejections  of  the  i)cruianent  income 
hypothesis  in  Campbell  and  Deaton  [1989]  and  West  [1988],  a?  those  researchers  emploj'-ed 
information  on  asset  holdings,  in  addition  to  consimiption  and  income.  Following  the 
reasoning  in  Campbell  and  Deaton,  that  rejection  must  therefore  arise  from  violation  of 
the  usual  cross  equation  restrictions,  and  not  from  "excess  smoothness"  per  se. 

Flavin  [1988]  has  criticized  the  work  by  Campbell  and  Deaton  [1989]  and  West  [1988] 
from  a  different  perspective.  Flavin's  model  departs  from  the  permanent  income  theory; 
under  the  hypothesis  in  that  work,  consumption  and  savings  will  not  contain  all  relevant 
information.  By  contrast,  we  argue  here  that  the  main  force  of  Flavin's  conclusion  holds 
even  under  file  permanent  income  ijypoiijesjs. 

3.  The  Model 

Hansen  [1987]  and  Sargent  [1989]  have  provided  dynamic  general  equihbrium  interpreta- 
tions of  the  permanent  income  hypothesis  (hereafter  PIH).  The  specification  follows  that 
in  Hall  [1978]  and  Flavin  [1981],  and  comprises  the  three  equations: 

C{i)  =  rW{i),  (3.1) 


W{i)  =  K{t)  + 


j=o 


(3.2) 


K{i  +  l)  =  {l+T)K{i)  +  Y{t)-C{t).  (3.3) 

Equation  (3.1)  states  consumption  in  each  period  equals  permaueni  income — this  is  simply 
the  flow  of  rental  income  from  total  wealth  W,  accruing  at  the  time-invariant  equihbriurn 
risk-free  interest  rate  r.    Total  wealth  is  the  sum  (3.2)  of  physical  capital  K  and  human 
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wealth.  Human  wecdth,  in  turn,  is  the  expected  present  discounted  value  of  the  stream  of 
labor  income  Y.  As  usucJ  in  this  literature,  we  assume  the  labor  income  stream  is  exoge- 
nous with  respect  to  the  agent's  consumi)tion  decision.  However,  totid  income — the  sum  of 
labor  and  capital  income — obviously  depends  on  the  agent's  past  consumption  decisions. 
In  summary,  the  agent  consumes  the  resource  stream  that  flows  from  renting  out,  in  per- 
fect markets,  all  of  her  physical  and  human  capital.  Equation  (3.3)  simply  defines  capital 
stock  transition:  capital  doesn't  depreciate,  and  accumulates  through  agents'  savings. 
These  equations  can  be  combined  to  obtain: 

AC{i)  =  C{t)  -  C(<  -  1)  =  -^  •  5](1  +  r)->  {E,Y{i  +  j)  -  E,.,Y{i  +  j)) . 

Defining  /?  to  equal  (1  -|-  t-)"-^  ,  this  is: 


+  .=0 


AC{t)^{l-(3)J^(3^{E,Y{t  +  j)-E,_,Y{t+j)).  (3.4) 

i=o 

As  numerous  authors  have  emphasized,  the  right  hand  side  of  (3.4)  is  the  annuity  value  of 
revisions  in  the  expected  labor  income  stream — these  revisions  are  due  to  new  information 
arriving  in  period  t.  The  model  predicts  the  larger  is  the  impact  of  "news"  on  human 
wealth,  the  larger  should  be  the  change  in  consumption. 

Equation  (3.4)  yields  two  potentially  refutable  predictions.  First,  given  a  particular 
data  generating  process  for  labor  income,  the  magnitude  of  consumption's  response  to 
news  can  be  calculated  from  (3.4).  Second,  information  avculable  prior  to  an  arbitrary 
time  period  t  should  neither  affect  nor  help  to  predict  the  change  in  consumption  at  t,  i.e., 
consumption  should  be  a  martingale  with  respect  to  agents'  inforination — this  is  simply 
Hall's  [1978]  famous  characterization  of  consumption  under  the  PIH.  Contradiction  of  these 
imphcations  is  referred  to  as  "excess  smoothness"  and  "excess  sensitivity"  respectively. 
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In  this  model,  Hall's  martingale  characterization  is  clearly  independent  of  the  exact 
process  that  generates  labor  income.  However,  the  appropriate  statistical  theory  for  infer- 
ence should,  of  course,  depend  on  the  properties  of  the  instruments  used  to  examine  the 
martingale  restriction.  But  this  will  always  be  true  in  any  econometric  procedure,  and  is 
not  particularly  special  to  the  PIH. 

The  "smoothness"  predictions,  however,  depend  critically  on  the  model  generating 
labor  income.  That  model  is  what  defines  news  which,  in  turn,  affects  consumption  through 
(3.4).  To  see  this  explicitly,  we  briefly  summarize  Beaton's  [1987]  excess  smoothness 
argument. 

First,  suppose  labor  income  K  is  a  trend-stationary  process.  We  can,  without  loss, 
take  the  trend  to  be  identically  zero,  since  we  are  here  interested  only  in  the  second  moment 
properties  of  consumption  and  income.  If  Y  has  finite  time-invariant  second  moments,  it 
is  guaranteed  to  have  a  unique  Wold  representation: 

Y{t)  =  Y^b{kUt-k)  =  B{L)v{t), 

k=0 

where  6(0)  =  1,  the  function  B{z)  =  Y2T=o  K^)^'^  ^  0  for  all  z  on  the  closed  unit  disk,  L 
denotes  the  lag  operator,  and  t?  is  serially  uncorrelated.  Suppose  the  representative  agent 
uses  only  current  and  lagged  labor  income  observations  to  forecast  future  labor  income.  A 
result  due  to  Hansen  and  Sargent  [1980,  Appendix  A]  then  impHes  a  simple  formula  for 
human  wealth:'^ 


j=0 


LB{L)-^B{(3) 


vit)- 


9- 


Further,  by  a  property  of  iterated  expectations: 
j=o  \j=o 


LD{L)-PB{I3) 
L-13 


vit) 


LB{L)-lBiP)       , 
L^^  ^ 


L77(0, 


where  [      ]  ,   denotes  the  annihilation  operator.^  Using  these  in  (3.4),  the  resulting  change 


in  consumption  is: 


ACW  =  (l-;3)(-^^^^^-^^(^^ 


L-f3 


LD{L)-mP)r-i 


L     7/(0. 


This  simphfies  to: 


AC(/)  =  (l-;9)5(/3)77(0. 


(3.5) 


Thus,  in  any  given  period,  the  change  in  consumption  depends  on  (1)  the  interest  rate, 
through  (3,  (2)  the  dynamics  B  of  labor  income,  and  (3)  the  innovation  -q  in  labor  income. 
Given  /3  and  5,  the  change  in  consumption  is  proportional  to  news  in  labor  income.  Since  /3 
is  close  to  1  for  small  values  of  the  interest  rate,  other  things  equal,  changes  in  consumption 
should  be  relatively  "small." 

Next,  suppose  labor  income  is  difference  stationary.  As  in  our  treatment  of  the  trend 
stationary  case,  we  ignore  possible  drift  in  labor  income,  since  that  cannot  affect  the  second 
moment  properties  of  consumption  and  income.  Denote  changes  in  Y  by  l^Y .  Assuming 
the  process  /\Y  has  finite  time-invariant  second  moments,  it  necessarily  has  a  unique  Wold 
representation: 

where  a(0)  =  1,  the  function  A{z)  =  Zl^o  ^(^)^'"  r^  '-'  ^°^  l-l  —  •'■'  ^^"^  ^  ^^  serially 
uncorrelated.    If  agents  use  only  current  and  lagged  values  of  labor  income  to  forecast 
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future  labor  income — as  Deaton  [1987]  assumed — we  can  again  use  the  Hansen-Sargent 
analysis  to  obtain:^ 

oo  ( LA{L)  -  (ji^)  A{0){1  -  L)' 

As  before,  a  property  of  iterated  expectations  implies  that: 

By  the  same  reasoning  as  above,  changes  in  consumption  follow: 

AC(<)  =  (1  -  /?) .  (1  -  /?)-M(/?)6(f,)  =  A{(3)t{i),  (3.6) 


Equation  (3.6)  is  the  analogue  to  (3.5)  when  labor  income  is  difference  stationary  rather 
than  trend  stationary.  Comparing  these  two  equations,  notice  (3.6)  does  not  contain  the 
term  (1  — /3);  this  is  why,  other  things  equal,  the  PIH  under  difference  stationarity  implies 
a  relatively  more  volatile  consumption  series. 

When  A  and  Var(e)  are  estimated  on  US  aggregate  time  series  data,  the  impHed 
variance  of  the  right  hand  side  of  (3.6)  is  significantly  larger  than  the  sample  variance 
of  changes  in  consumption. ■^^  From  this  evidence,  Deaton  [1987]  concluded  that  aggregate 
consumption  is  excessively  smooth  if  labor  income  is  characterized  as  an  integrated  process. 

In  US  aggregate  data,  labor  income  appears  to  be  well-described  as  being  integrated.— 
However,  there  is  certainly  no  compelling  evidence  that  agents  in  the  economy  estimate 
humaji  wealth  using  only  their  labor  income  history.  For  instance,  suppose  there  are  two 
kinds  of  structural  disturbances  to  labor  income.  One  class  of  disturbances  has  a  permanent 
impact  on  the  level  of  labor  income;  the  other  disturbances  have  only  transitory  impact.  For 
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simplicity,  we  can  suppose  that  there  are  only  two  structural  disturbances  in  the  economy — 
one  in  each  class.  Allowing  a  more  general  specification  does^  not  alter  the  conclusions  of 
interest  here,  although  typically  the  disturbances  wiU  not  aggregate  naturally  into  the  one 
permanent  and  one  transitory  component  (see  the  Technical  Appendix  in  Blanchard  and 

Quah  [1989]). 

Under  rational  expectations,  agents  estimate  their  human  wealth  using  all  available 
information  on  the  different  kinds  of  disturbances.  In  particular,  if  it  will  improve  their 
forecasts  of  future  labor  income,  they  will  use  the  information  that  there  is  a  permanent 
and  a  transitory  component  in  labor  income.  The  innovations  in  the  different  structural 
disturbances  may  be  correlated;  however,  one  can — in  the  natural  way  and  without  loss 
of  generality — construct  an  orthogonal  decomposition  to  use  in  forecasting  future  labor 


income. ^^ 


Therefore,  suppose  we  can  write: 

Y{i)=Y,{i)  +  Yo{i), 

where  Yi  is  difference  stationary,  and  Yq  is  covariance  stationary.   Assuming  AYi  and  Yo 
have  finite  time-invariant  second  moments,  we  can  write  their  Wold  decompositions  as: 

oo 
A:=0 

and 

oo 

Yo{i)  =  J2  (^o{h)eo{t  -  k)  =  Ao{L)coit), 

where  the  innovations  ei  and  eo  3-re  uncorrelated  at  aJl  leads  and  lags.^^  For  brevity,  we 
will  refer  to  Yi   and  Yq  as  the  permanent  and  transitory  components  in  labor  income. 
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respectively.     The  permanent  component  in  labor  income  should  not  be  confnsed  with 
permanent  income,  which  is  precisely  defined  from  the  equations  of  the  PIH. 

By  exactly  the  same  reasoning  as  in  the  cases  previously  considered,  equihbrium  con- 
sumption follows: 

AC{t)  =  ^i(/?)e:(f.)  +  (1  -  /3)Ao{/3)eo{i).  (3.7) 

Equation  (3.7)  shows  that  consumption  response  depends,  in  general,  on  the  kind  of  news 
that  dominates  in  any  given  period.  For  /?  close  to  unity,  news  that  has  only  transitory 
effects  will  have  only  relatively  small  impact  on  consumption;  the  opposite  is  true  of  news 
that  turn  out  to  have  permanent  effects.  Thus,  the  theory  predicts  that  consuinptlon 
volatihty  depends  on  the  relative  importance  of  permanent  and  transitory  components  in 
labor  income. 

To  see  expUcitly  the  impHcations  for  consumption  volatihty,  we  need  the  Wold  lag 
distributions  Ai  and  Ao,  as  well  as  the  innovation  variances  Var(ei)  and  Var(€o)-  While 
the  components  Yi  and  Yq  are  guaranteed  to  sum  to  the  observed  labor  income  process  Y, 
their  innovations  €i  and  cq  bear  no  simple  relation  to  the  innovation  e  in  Y.  Nor  is  there 
a  simple  relation  between  the  lag  distributions  Ai,  Aq  and  A:  in  particular,  it  is  not  true 
that  Ai{z)  +  (1  -  z)Ao{z)  equals  A{z). 

How  then  is  such  a  decomposition  into  permanent  and  transitory  components  consis- 
tent with  the  time  series  observations  on  aggregate  labor  income?  It  is  clearly  necessary 
that  the  spectral  densities  of  AYi  and  AYo  sum  pointwise  to  equal  the  spectral  density  of 
AY.  Thus,  for  all  lj  in  (— 7r,7r],  we  have: 

Var(6)lA(e-''^)|2  =  Var(ei)|^i(e-*")i^  +  Var(eo)il  -  e-'-f  l^o(e-'-^)|^ 
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Under  weak  regularity  conditions,  this  relation  across  the  spectral  densities  is  not  only  nec- 
essary but  also  sufficient  to  characterize  the  orthogonal  decomposition  (see  Quah  [1989]). 
In  the  subsequent  discussion,  we  can  therefore  focus  only  on  this  pointwise  equahty  in  the 
spectral  densities.  Making  the  natural  definitions,  we  can  write  this  as: 

S{lj)  =  5i(w)  +  11  -  e-''"|=5o(a;).  (3.S) 

This  equation  has  two  important  features  that  we  will  use  repeatedly  below.  First,  since 
the  second  term  on  the  right  hand  side  is  nonnegative,  Si  must  be  everywhere  bounded 
from  above  by  S.  Next,  Si  must  equal  S*  at  u;  =  0,  since  the  second  term  on  the  right 
hand  side  vanishes  there.  Thus,  Si{u))  <  S{uj)  for  all  w,  with  strict  equality  at  w  =  0.  {n 
words,  the  spectral  density  of  changes  in  observed  labor  income  forms  an  outer  envelope 
for  that  in  its  permanent  component,  where  that  outer  envelope  is  binding  at  frequency 
zero.  Put  another  way,  the  spectral  densities  of  changes  in  the  permanent  and  transitory 
components  is  a  cleaving  of  that  in  observed  labor  income.^'*  Figure  1  illustrates  such  a 
cleaving  of  a  spectral  density. 

By  the  equality  at  frequency  zero  of  S  and  Si,  agents'  forecasts  of  the  long  run  effects 
of  a  disturbance  are  always  the  sanie — regardless  of  whether  agents  view  the  disturbance  as 
one  in  the  permanent  component,  or  as  one  in  observed  labor  income  itsclf.^^  The  relative 
importance  of  the  permanent  and  transitory  components  is  altered  as  we  vary  the  cleaving 
of  the  spectral  density.  Across  all  cleavings,  however,  the  measure  of  long  run  persistence 
always  remains  the  same.  Equation  (3.7)  suggests  then  that  as  long  as  3  is  strictly  less  than 
one,  the  volatihty  of  consumption — which  appears  to  vary  with  the  relative  importance  of 
the  permanent  and  transitory  components — is  not  determined  in  any  essential  way  by  the 
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magnitude  of  long  run  persistence.    In  the  next  section,  we  will  use  the  spectral  density 
characterization  to  show  this  rigorously. 

4.  Explaining  "Excess  Smoothness" 

A  first-order  autoregressive  model  for  the  first  differences  of  US  aggregate  labor  income 
yields  point  estimates  of  0.44  for  the  autoregressive  coefficient  and  63G.1  for  the  innovation 
variance.  If  the  risk-free  interest  rate  r  is  taken  to  be  1  percent  per  quarter,  then  equation 
(3.6)  impHes  the  variance  of  consumption  changes  should  be  1997;  the  actual  observed 
sample  variance  is  only  246.^^  Tables  1  and  2  in  West  [1988]  display  different  ARIMA 
parametrizations  for  labor  income  that  all  show  the  same  conclusion.  Consumption  appears 
to  be  "excessively  smooth,"  given  the  maintained  assumption  that  labor  income  is  an 
integrated  process. 

Now  suppose  agents  use  information  on  permanent  and  transitory  movements  in  labor 
income  to  estimate  their  hum.an  wealth.  Recall  consumption  should  then  behave  as: 

AC{t)  =  A,{p)e,{t)  -f  (1  -  /3)Ao{fi)eoit), 

which  imphes  the  variance  of  consumption  changes  is: 

Var(AC)  =  A,{P)'  •  Var(6a)  +  (1  -  P)'Ao{P)'  ■  Var(eo). 

Clearly,  the  univariate  Wold  characterization  of  labor  income  does  not  directly  restrict  the 
smoothness  properties  of  consumption. 

A  simple  example  will  build  intuition  for  the  calculations  to  follow,  although  the  exam- 
ple isn't  completely  successful  in  explaining  "excess  smoothness.''  Suppose  the  permanent 
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component  Yi  in  labor  income  is  described  by  Ai{z)  =  (1  —  7^)"^  with  I7I  <  1,  i.e.,  the 
permanent  component  is  a  stationary  first  order  autoregressive  process  in  first  differences. 
Assume  the  correct  model  for  observed  labor  income  is  the  first  order  autoregression  in 
first  diff"erences  above.  For  the  "outer  envelope"  condition  described  above  to  hold,  we 
must  have  (1)  7  >  0.44,  and  (2)  Var(ei)  =  (1  -  7)^  x  (1  -  0.44)-=  x  G36.1.  Condition  (1) 
guarantees  the  spectral  density  5  of  Ay  dominates  that  of  AYi;  condition  (2)  restricts 
these  spectral  densities  to  be  equal  at  frequency  zero.  It  follows,  then,  that  AY  —  AYi  is 
the  first  diff'erence  of  a  process  that  is  covariance  stationary.^^ 

The  pointwise  equahty  of  the  spectral  density  sum  (3.8)  then  allows  derivation  of  the 
dynamics  in  Yo-  For  all  z,  we  have: 

Ya.i{eo)-{l-z){l-z-')Ao{z)Aoiz-')  '      ' 

=  Var(6)  •  A{z)A{z-')-  Var(ei)  •  A,{z)A,{z-') 


Var(6) 


X 


l-OAAzJ  \l  -0.44Z-1 


1-7 
1  -  0.44 


1  —  7;:/    \  1  —  7z~^ 


=  Var(e) 


(l_7„-)(l_7,-i)_ 


1-7 
1-0.44 


(1  -0.44z)(l  -0.44z-^) 


(1  -  0.44r)(l  -  7z)(l  -  0.44z-i)(l  -  -yz-^) 


Since  the  numerator  of  the  right  hand  side  expression  vanishes  at  z  =  1,  we  can  divide 
both  sides  by  (1  —  z)(l  —  z~^)  to  obtain: 
Var(eo)  •  Ao(--).4o(z-^)  =  Var(e)x 


(l_7,)(l_7,-i) 


1 — r 
1-0.44 


(1  -  0.44r)(l  -  0.44Z-1)     /(I  -  z)(l  -  z'^) 


(1  -0.44z)(l  -7z)(l  -0.44z-i)(l  -jz'^) 
1  2 


Var(e) 


7- 


l--r 
1-0.44 


X  0.44 


(1  -  0.44z)(l  -  7z)(l  -  0.44z-i)(l  -  7Z-1) 
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But  this  last  expression  is  simply  the  covariance  function  of  a  second  order  autoregressive 


process: 


Yo{i)  =  (0.44  +  'y)Yo{t  -  1)  -  (0.44  •  ^)Yo{t  -  2)  +  eo{t). 


where  Var(eo)  =  Var(e)  •     7  - 


1-7 
1-0.44 


X  0.44  j  is  positive  since  0.44  <  7  <  1. 
Recall  the  contribution  to  Var(AC)  is  Ai{/3)^  •  Var(ei)  for  disturbances  ei  that  have 
permanent  impact  on  labor  income,  and  (1  —  (3)'^Ao{P)'^  •  Var(eo)  for  disturbances  eo  with 
only  transitory  effects.  For  the  example  here,  these  are: 

-2 


(1  -  7/5)"'(l  -  7)^(1  -  0.44)-^  X  636.1, 


and: 


{l-/3)\l-0A4P)-\l-^py 


7- 


1-7 


0.44 


X  G36.1. 


1  -0.44 

respectively.  The  predicted  variance  of  AC  is  simply  the  sum  of  these.  For  7  =  0.5,  the 
implied  value  of  Var(AC)  is  1989;  7  =  0.75,Var(AC)  =  1916;  7  =  0.8,Var(AC)  =  1882; 
7  =  0.9,Var(AC)  =  1727;  7  =  0.95,Var(AC)  =  1493;  7  =  0.99,Var(AC)  =  1017;  7  = 
0.995,  Var(  AC)  =  1083. 

Allowing  agents  to  distinguish  permanent  and  transitory  components  in  labor  income, 
therefore,  can  potentially  smooth  the  consumption  implied  by  the  PIH.  Even  when  Yi  is 
restricted  to  be  a  first  order  autoregression  in  first  differences,  consumption  volatility  can 
fall  by  as  much  as  one  half  over  that  when  agents  forecast  labor  income  using  only  its  past 
history.  The  intuition  of  the  previous  section  is  therefore  correct:  altering  the  cleaving  of 
a  fixed  outer  envelope  spectral  density  S  affects  the  PIH  prediction  for  the  volatility  of 
consumption.  This  consumption  smoothing  occurs  without  having  to  change  any  of  the 
univariate  properties  of  the  labor  income  process. 


-17- 
To  complete  the  argument,  we  need  to  show  a  cleaving  exists  that  reconciles  the 
actual  volatihty  in  consumption  with  the  predicted  volatihty,  taking  as  given  the  univariate 
dynamics  in  labor  income.  This  existence  question  can  be  formulated  in  terms  of  an  infinite- 
dimensional  optimization  problem.  Take  as  given  (1)  a  Wold  decomposition  {A,e)  for  Ay, 
and  (2)  a  real  interest  rate  r  implying  a  value  for  j3.  What  is  the  minimum  value  of 
Ai{/3fVa.T{ei)  +  (1  -  /5)^^o(/5)^Var(eo)  such  that  (3.8)  is  satisfied?  Formally,  we  have  to 
solve: 

inf  yl:(/3)=Var(e:)  +  (1  - /3)==.4o(^)=Var(eo) 

Ai,Ao,Vi>-T{(i),\'n.T(€o) 

subject  to: 

a.  Var(e)|A(e-'''")|=  =  Var(ei)|.4a(e-'")|'  +  Var(eo)ll  -  e-'"|=|^o(e-'")|'      for  all  a;, 

b.  (u4i,ei)  and  {Ao,€o)  are  Wold  representations. 

The  natural  parameter  space  is  infinite-dimensional  and  equals  C'  x  C"  x  7^:^..    Given  (1) 

and  (2),  consumption  displays  excess  smoothness  if  this  program  has  VcJue  exceeding  the 
sample  estimates  of  Var(AC). 

Instead  of  solving  this  problem  directl}--,  it  is  sufficient  to  display  an  example  satisfying 
a.  and  b.  that  achieves  a  value  equal  to  the  sample  estimate  of  the  variance  of  consumption 
changes.  As  before,  a  choice  for  Ai  immediately  determines  aU  the  remaining  parameters. 
Equahty  of  Sj  and  S  at  frequency  zero  fixes  the  innovation  va.riance  Var(ei)  in  the  perma- 
nent component.  The  pointwise  equahty  for  all  u,  given  as  |l  — e~"^l*5o(w)  =  S{u)  —  Si{uj) 
determines  the  function  Sq.  This  spectral  density  Sq  can  then  be  factored  to  obtain 
uniquely  Var(eo).  and  Aq  in  £'.  where  An(0)  =  1.  and  Ac{z)  ^  0  for  all  \z\  <  1.  This 
calculation  is  standard  since  the  resulting  spectral  density  is,  by  construction,  a  rational 
function:  see,  for  example,  Rozanov  [19G7]  Chapter  1,  Section  10,  pp.  43-50. 
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More  explicitly,   fix  a  candidate  Wold  representation  {A,e)   for  AY.     Consider  lag 
distributions  Ai  of  the  form: 

A,{z)  =  {l+z)yAu{z), 

where  Aid  is  some  fixed  polynomial,  such  that  ^i(/(0)  =  1,  and  ^id(z)  ^  0  for  \z\  < 
1.^^  This  restricts  the  permanent  component  Yi  to  be  an  ARIMA  process,  with  the 
moving  average  part  having  binomial  coefficients.  As  q  increases,  the  spectral  density 
Var(ei)|Ai(e~"'')p  is  guaranteed  eventually  to  be  bounded  from  above  by  any  fixed  spec- 
tral density  that  shares  the  same  value  at  frequency  zero.-'^  By  5i(0)  =  5(0),  it  follows 
that: 

Var(ei)  =  4-^1^(1)2^(1)2  x  Var(e). 

In  the  second  step,  the  dynamics  of  AYq  satisfies: 
Var(eo)  •  (1  -  z)(l  -  z-^)Ao{z)Ao{z-^)  =  Var(e)x 

The  right  hand  side  above  vanishes  at  z  =  1,  by  our  choice  of  Var(ei).  We  can  therefore 

write: 

(Aiz)Aiz-^ )  -  4-Mi,(l)M(l)2  .  ^^i^tS^rff^) 
Var(6o)^o(z)Ao(z-^)  =  Var(6)  x  ^ ^i  -  ,)^i  -  ,-i)  ' 

For  sufficiently  large  g,  the  right  hand  side  is  the  covariogram  of  a  real  covariance  stationary 
process.    We  can,  therefore,  factor  it  to  obtain  Var(ei)  and  Aq  such  that  (1)  ^o(O)  =  1? 

(2)  the  power  series  expansion  of  Ac{z)  is  one-sided  in  non-negative  powers  of  z,  and  fijially, 

(3)  ^0(2)  ^  0  for  all  \z\  <  1.-°  Finally,  taking  a  value  for  /?,  the  resulting  lag  distributions 
and  innovation  variances  can  be  used  to  find  the  implied  variance  of  consumption  changes. 
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We  fix  (3  to  the  value  implied  by  a  risk-free  interest  rate  r  of  1  percent  per  quarter.'^ 
Tables  1-9  display  the  results  of  the  above  procedure  for  each  of  the  candidate  Wold  rep- 
resentations for  Ay  in  West  [1988].  West  estimated  a  variety  of  models  to  check  the 
robustness  of  his  findings.  Although  our  results  lead  to  the  opposite  conclusion  from  his, 
here,  as  in  his  work,  the  findings  do  turn  out  to  be  insensitive  to  the  exact  parametric 
specification.  The  first  panel  in  each  table  gives  point  estimates  for  alternative  AKMA 
parametrizations  of  the  Wold  representation  of  Ay.  In  our  notation,  the  ARMA  parame- 
ters (j)  and  9  satisfy  A{z)  =  (1  —  <f>iz  —  <}>2Z')~^{\  -f  B-yz  -\-  ^2-')-  The  innovation  variance 
associated  with  the  given  AliMA  parametrization  is  next  presented.  These  always  exceed 
GOO;  the  largest  vaJue  is  naturally  that  in  Table  1 — the  random  walk  case.  Next,  LK  denotes 
the  imphed  measure  of  long  run  persistence,  as,  for  example,  in  Campbell  and  Mankiw 
[1987].  Following  that,  t/iq  is  the  square  root  of  the  ratio  of  PIH-predicted  Var(AC)  to 
Var(e);  filially,  ip  is  the  actual  square  root  of  the  ratio  of  Var(AC)  to  Var(e)  found  in  the 
data.  The  discrepancy  between  •0o  and  ^  is  one  representation  of  the  Deaton  paradox. 

The  second  panel  in  each  table  shows  the  impHed  ■I'B.riance  of  consumption  changes  due 
to  the  hypothesized  permanent  and  transitory  components  in  labor  income.  For  alterriative 
settings  of  q — the  moving  average  length  in  Ai'i — we  show  first  the  individual  variance 
contributions  of  the  different  kinds  of  disturbances,  and  then  the  sum  of  these  contributions. 
Notice  that  the  contribution  of  e\  is  always  much  larger  than  that  of  cq-  This  is  consistent 
with  the  message  in  Lucas  [1987,  Chapter  3]  that  cycUcal  fluctuations,  by  comparison  with 
secular  movements,  are  simply  not  significant  for  many  economic  questions. 

The  last  row  in  this  panel  shows  t/'i  the  square  root  of  the  ratio  of  the  implied  Var(AC) 


/' 
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to  Var(e).  As  q  increases,  the  value  of  i/'i  falls  nionotonically.  In  the  last  column  of  this 
panel,  we  show  the  value  of  q  that  imphes  t/'j  =  V'-  Finally,  the  last  section  of  each  table 
presents  the  value  for  Var(eo)  associated  with  that  q  that  matches  V'l  to  V'-"' 

The  last  column  in  the  second  panel  of  each  table  therefore  answers  positively  the 
existence  question  above.  For  all  9  ARM  A  models  hypothesized  for  the  Wold  representation 
of  Ay,  there  exists  a  permanent-transitory  decomposition  that  exactly  matches  the  PIH 
predicted  consumption  volatiHty  with  the  data.  For  all  9  models  considered,  the  long-run 
measure  of  persistence  LH  is  substantial.  Despite  this,  for  all  9  models,  the  PIH — properly 
considered — does  not  predict  consumption  volatihty  exceeding  that  in  the  data.  Along 
this  dimension,  therefore,  the  PIH  is  not  inconsistent  with  the  data.  In  summary,  the  PIH 
predictions  do  not  particularly  depend  on  (1)  whether  labor  income  is  best  characterized 
as  being  integrated,  nor  (2)  the  magnitude  of  labor  income's  long  run  persistence,  nor  (3) 
the  exact  form  of  labor  income's  univariate  dynamics.  By  the  nature  of  the  argument,  it 
should  be  clear  that  these  conclusions  would  hold  regardless  of  the  exact  parametrization  of 
the  Wold  representation  for  AY",  even  beyond  the  ARMA  (2,  2)  cases  expHcitly  considered 
here. 

Are  the  permanent-transitory  decompositions  that  reconcile  the  PIH  with  the  data 
reasonable?  It  is  difficult  to  interpret  directly  the  permanent  and  transitory  components 
used  here  for  forecasting,  since  they  are  not  necessarily  structural  economic  disturbances. 
Thus,  one  should  not  read  the  q  values  in  the  last  column  of  the  tables  to  say  that  economic 
agents  perceive  structural  shocks  with  permanent  effects  as  very  long  ARIAJA  processes. 
The  true  structural  shocks  agents  see  are  Hkely  to  be  imperfectly  correlated  across  distur- 
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bances  with  permanent  and  transitory  effects. 

It  might  be  interesting  to  explicitly  identify  the  structiy-al  disturbances  that  agents 
see  driving  labor  income.  However,  such  an  exercise  is  not  at  ;U1  relevant  in  the  current 
context.  Instead,  here  we  might  simply  compare  our  transitory  component  in  labor  income 
with  stationary  components  that  others  have  estimated.  Figure  2  plots  the  response  in 
labor  income  to  a  unit  disturbance  in  the  transitory  component.  The  response  for  each  of 
the  9  models  considered  in  the  tables  is  graphed.  In  every  case,  the  effects  have  a  hump 
shape  and  decay  rapidly:  no  more  than  half  the  original  impact  of  the  disturbance  remains 
after  four  years.  This  is  quite  consistent  with  the  moving  average  representation  that,  for 
instance,  Blan chard  and  Quah  [19S9]  call  the  dynamic  effects  of  "aggregate  demand"  in 
their  study  of  GNP. 

5.  Effects  of  Agents'  Superior  Information 

Campbell  and  Deaton  [1989]  and  West  [1988]  have  also  emphasized  that  economic  agents 
are  hkely  to  have  more  information  than  the  econometrician.  Equihbrium  consumptioii  is 
then  necessarily  smoother  than  when  agents  use  only  the  past  history  of  labor  income  to 
forecast  future  labor  income.  The  question  becomes,  could  this  superior  information  effect 
suffice  to  account  for  the  observed  smoothness  in  the  data?  CampbcU  and  Deaton  [1989] 
and  West  [1988]  find  the  answer  to  be  no. 

Suppose  a  researcher  attempts  to  account  for  tliis  superior  information  by  studying  the 
liistory  of  time  series  observations  on  consumption  and  income.*^  Recall  that  consumption 
is  a  martingale  under  the  rational  expectations  version  of  the  PIH.  Thus  it  is  natural  to 
suspect  that  the  history  of  consumption  should  contain  all  the  relevant  information  that 
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agents  use  to  forecast  the  future.  In  other  words,  even  though  economic  agents  are  hkcly 
to  have  more  information  than  the  researcher,  studying  the  joint  consumption-income 
process  should  allow  discovery  of  the  correct  relation  between  "news"  and  the  reaction  in 
consumption,  even  though  the  researcher  never  directly  observes  "news."  This  argument 
appears  to  be  related  to  a  result  in  Hansen  and  Sargent  [1981].  Hansen  and  Sargent's 
[1981]  Theorem  shows  that,  under  certain  conditions,  the  hallmark  rational  expectations 
cross-equation  restrictions  hold,  even  when  the  researcher  uses  an  information  set  strictly 
smaller  than  that  of  economic  agents. 

The  explanation  given  in  this  paper  of  course  says  that  agents  have  more  information 
than  the  econometrician.  Why,  then,  doesn't  consumption  appropriately  reveal  the  news 
that  agents  see?  The  reason  for  this  is  interesting  in  its  own  right:  the  PIH  turns  out  to 
imply  that  agents  observe  innovations  that  are  not  fundamental  for  the  joint  consumption- 
income  process.  We  now  show  this  expHcitly.^'* 

Under  the  PIH,  and  our  assumptions  on  permanent  and  transitory  fluctuations,  the 

joint  process  for  the  changes  in  labor  income  and  consumption  is: 

fAY{i)\_fML)     {1  -  L)Ao{L)\  f  e,{t)\ 
{AC{t)J-[M/3)     (1  -  P)Ao{p)  J  {eoit)  J  •  ^^"^^ 

The  determinant  of  this  matrix  moving  average  is  the  function: 

6{z)  =  (1  -P)Ao{/3)A^{z)  -  Ar{p){l  -  z)Aoiz), 

Two  features  of  this  function  should  be  noted  here:  (1)  The  determinant  8  is  different  from 
zero  at  z  =  1.  The  spectral  density  of  the  jointly  covaxiance  stationary  vector  {AY',  AC) 
is  therefore  fuU  rank  at  frequency  zero.  In  words,  labor  income  and  consumption  are  not 
cointegrated.  (Campbell  [1987]  has  made  the  same  observation.) 
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The  intuition  is  straightforward:  the  martingale  consumption  impHcation  of  the  PIH 
means  that  any  news  will  have  permanent  impact  on  the  level  of  consumption.  In  partic- 
ular, even  news  that  has  only  transitory  effects  on  labor  income  has  permanent  effects  on 
consumption.  (2)  The  determinant  6  vanishes  at  z  =  /?,  which  is  strictly  inside  the  unit 
circle.  But  then,  ej  and  eo,  as  well  as  all  hnear  combinations  of  them,  cannot  be  recovered 
from  observations  on  current  and  lagged  values  of  the  exogenous  and  endogenous  variables 
Ay  and  AC  (see  for  example  Rozanov  [19G7],  Remark  3,  p.  C3.). 

Note  this  does  not  mean  agents  are  somehow  forecasting  using  future  values  of  labor 
income.  Recall  agents  use  only  current  and  lagged  values  of  Yi  and  Yq  to  predict  future 
labor  income.  The  nonfundamentcJness  means  simply  that  the  observed  variables  cont;un 
strictly  less  information  than  that  in  the  e's.  We  can  see  exphcitly  the  implications  for 
inference  by  considering  a  simple  example. 

Consider  the  Friedman-Muth  model  (Muth  [i960]):  Suppose  the  permanent  compo- 
nent is  a  random  walk,  Ai{z)  =  1,  and  the  transitory  component  is  white  noise,  Ao[z)  =  1. 
Further,  assume  the  innovations  ei  and  cq  have  unit  variances.  Substituting  into  (5.1), 
these  assumptions  imply  agents  in  the  economy  observe  the  bivariate  income-consumption 
model: 

{AC{i)j-{l      {I  -  P)  j  [eo{i)  J  ■  ^""-i 

When  an  econometrician  studies  the  history  of  observations  on  labor  income  and  con- 
sumption, the  most  information  she  can  recover  is  their  true  Wold  representation — the 
projection  of  {AY.  AC)  on  its  lagged  values.*^  It  is  not  hard  to  show,  given  (5.2),  that 
the  unique  Wold  representation  for  (Ay,  AC),  with  a  pairwise  orthogonal  unit  variance 
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innovation  vector  is:  ;  i 

where  A(/3)^  =  1  +  (1  —  /3)^.  The  disturbances  771  and  772  are  pairwise  orthogonal  white 
noise  having  unit  variance.'^  While  the  determinant  of  the  matrix  moving  average  in  the 
agents'  model  (5.2)  is  z  —  /3,  and  vanishes  at  /?  <  1,  that  in  the  econometrician's  model 
(5.3)  is  A(/3)(l  —  f3z),  which  vanishes  nowhere  on  the  unit  disk.  The  econometrician's 
representation  is  therefore  fundamentcd. 

The  cross  equation  restrictions  in  (5.1)  completely  describe  the  predicted  response 
of  consumption  to  "news"  in  labor  income.  A  disturbance  to  labor  income,  whose  first 
difference  has  Wold  lag  distribution  (1  —  z)Aj(z)  should  lead  to  a  consumption  response 
of  (1  —  l3)Aj(/3).  Consider  the  econometrician's  representation  (5.3).  In  response  to  an 
771  disturbance,  consumption  responds  by  A(^),  which  is  exactly  A(/?)~^(2  —  /?)(!  —  tEtb^) 
evaluated  at  z  =  ^.  Thus,  the  econometrician  will  infer  that  consumption  appears  to 
respond  appropriately  to  the  disturbance  771.  Next,  consider  an  772  disturbance.  The 
econometrician  reasons  that  consumption  should  respond  to  772  by  A(/3)~-^(l  —  /3')  >  0; 
however,  in  the  data,  consumption  does  not  at  all  react  to  772.  In  other  words,  consumption 
will  appear  to  be  "excessivelj'  smooth,"  even  though  the  joint  income-consumption  process 
satisfies  the  PIH. 

This  example  shows  why  an  econometrician,  studying  the  past  history  of  observed 
model  variables,  might  not  draw  correct  inference  on  the  dynamic  efi"ects  of  different 
disturbances.^'  It  is  not  the  case  that  agents  observe  future  events  that  the  econome- 
trician need  only  wait  to  similarly  observe.    Agents  in  the  model  condition  their  actions 
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only  on  observations  on  the  past  history  of  permanent  and  transitory  disturbances;  no 
future  information  is  involved. 

From  (5.1),  our  theory  clearly  implies  restrictions  across  the  equations  for  income  and 
consumption.  In  principle,  the  model  (5.1)  could  be  estimated  and  tested,  for  example, 
by  majcimizing  the  Whittle  frequency  domain  likehhood.  The  results  of  that  exercise 
are  known  in  advance  though.  Aggregate  consumption  is  not  a  martingcUe,  and  so  the 
PIH  would  be  rejected:  see,  for  exaniple,  Christiano,  Eichenbaum  and  Marshall  [1987], 
Heaton  [19S8],  or  Nelson  [1987].  But  evidently  this  would  be  for  reasons  other  than  "excess 
smoothness." 

6.  Conclusion 

Deaton  [1987]  noted  that  (1)  the  permanent  income  hypothesis,  (2)  a  unit  root  for  the  labor 
income  process,  and  (3)  the  estimated  univariate  dynamics  of  US  aggregate  labor  income, 
together  imply  volatile  consumption.  Such  volatihty  is  not  seen  in  the  time  scries  data 
on  aggregate  consumption.  However,  if  labor  income  is  modelled  as  trend-stationary,  then 
the  permanent  income  hypothesis  imphes  consumption  volatihty  roughly  in  hne  with  that 
in  the  data.  Beaton's  finding  makes  a  strong  case  that  the  univariate  dynamic  properties 
of  labor  income — whether  labor  income  is  difference-stationary  or  trend-stationar}-,  its 
long  run  persistence,  its  univariate  short  run  dynamics — are  relevant  for  evaluating  an 
important  economic  hypothesis.  More  genercdly.  it  argues  for  the  importance  of  measures 
of  persistence,  as,  for  instance,  articulately  proposed  in  Campbell  and  Mankiw  [1987]. 

This  paper  has  shown  why  the  reasoning  above  is  misleading.  By  making  the  plausible 
assumption  that  agents  observe  different  kinds  of  disturbances  to  their  labor  income  stream, 
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the  volatility  predictions  of  the  permanent  income  hypothesis  can  be  brought  back  firmly 
in  line  with  the  data.    This  can  be  done  regardless  of  the  pi^ecise  form  of  the  univariate 
dynamics  in  labor  income.  Quite  generally  therefore,  this  paper  argues  that  the  uiiivariate 
characterizations  of  aggregate  time  series  are  simply  not  informative  for  economic  theor}'. 

The  idea  that  there  are  permanent  and  transitory  disturbances  in  tin^e  series  is  an  old 
one,  going  back  at  least  to  Milton  Friedman.  This  assumption  raises  interesting  testable 
hypotheses  in  many  areas  of  empirical  time  series  research.  That  agents  "see"  things 
unobscrvable  to  the  econometrician  has  already  shed  many  useful  insights,  such  a§  the 
notion  of  human  capital  in  labor  economics  and  growth.  In  the  current  paper,  it  has 
served  to  explcdn  a  puzzle,  where  consumption  smoothness  seemed  to  be  inconsistent  with 
a  unit  roots  representation  for  labor  income  dynamics. 

It  is  important  to  emphasize  what  the  paper  does  not  do.  First,  the  unit  roots  hypoth- 
esis has  been  critical  in  re-orienting  econometric  inference  and  modelling.  It  has  provided 
rich  insights  for  reinterpreting  evidence  on  many  interesting  economic  propositions.'^  This 
paper  does  not,  at  all,  argue  against  this.  The  results  in  this  paper  do,  however,  lead 
one  to  be  extremely  skeptical  of  conclusions  such  as  in  Deaton  [19S7]  and  Campbell  and 
Mankiw  [1987].  The  focus  in  those  papers  on  "large"  versus  "small"  unit  roots — an  unfor- 
tunate terminology  introduced  in  Cochrane  [1988] — and  the  idea  that,  somehow,  this  has 
something  to  do  with  interesting  economic  hypotheses  appears  unjustified.  It  would  be 
interesting  to  display  an  expHcit  economic  model  where  this  faith  is,  in  fact,  well-placed. 

Second,  the  paper  does  not  say  that  the  permanent  income  hypothesis  accurately 
describes  the  aggregate  time  series.  The  martingale  predictions  for  consumption,  original!)'' 
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developed  in  Hall  [1978],  are  now  well-known  to  be  false  in  the  data.   Allowing  agents  to 
observe  permanent  and  transitory  disturbances  separately  do.es  not  alter  this  conclusion. 
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Footnotes 

*  Department  of  Economics,  MIT  and  NBER.  I  am  gratefut  to  Ricardo  Caballero,  John 
Campbell,  Larry  Cliristiano,  Angus  Deaton,  Marjorie  Flavin,  Kenneth  West  and  David 
Wilcox  for  discussions  and  correspondence.  An  anonymous  referee  of  this  Journal  and 
seminar  participants  at  Columbia,  Berkeley,  MIT,  the  NBER  Summer  Institute,  Stanford 
and  UC  San  Diego  also  provided  helpful  comments.  I  thank  the  MIT  Statistics  Center  for 
its  hospitality.  AH  errors  and  misinterpretations  are  mine. 

^  It  is,  of  course.  Nelson  and  Plosser  [1982]  who  have  forcefully  drawn  macroeconomists' 
attention  to  the  fact  that  many  aggregate  time  series  may  be  difference  stationary  rather 
than  trend  stationary.  In  Nelson  and  Plosser's  terminology,  a  difference  stationasy  series 
has  first  difference  that  is  covariance  stationary,  although  the  series  itself  is  not;  a  trend 
stationary  series  is  covariance  stationary  about  a  deterministic  time  trend.  In  econometric 
terminology  therefore,  a  difference  stationary  series  is  integrated  of  order  1,  or  simply 
integrated,  when  the  order  can  be  omitted  without  ambiguity. 

^  This  characterization  is  exphcitly  derived  from  an  optimizing  equihbrium  model  below. 
Thus,  it  should  be  distinguished  from  that  such  as  in  Stock  [19SS],  where  consumers  ig- 
nore transitory  income  altogether.  Further,  the  coefficient  on  permanent  income  in  the 
consumption  equation  in  that  work  is  treated  as  a  free  parameter.  By  contrast,  in  the 
kind  of  models  considered  here,  that  coefficient  is  intimately  related  to  labor  income  dy- 
namics. Stock  [19SS]  expertly  apphes  recent  developments  in  the  theory  of  regression  with 
cointegrated  variables  to  re-estabhsh  Friedmaii's  assertions  about  errors-in-variables  bias. 
Deaton's  paradox  does  not  arise  in  those  kinds  of  models.  The  questions  of  interest  there 
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are  different  from  those  considered  here.        ' 

^  Recent  papers  that  have  provided  arguments  similar  in  spirit  to  that  here  are  Blanchard 
and  Quail  [1989]  and  Christiano  and  Eichenbaum  [1989].  The  first  does  so  in  a  Kej^ne- 
sian  model  with  sticky  nominal  wages,  while  the  second  makes  the  point  in  discussing 
productivity  disturbances  in  a  real  business  cycle  model. 

■*  Note  that  we  are  not  suggesting  that  this  permanent- transitory  decomposition  will  ex- 
plain the  other  anomalies  in  the  predictions  of  the  permanent  income  model.  For  instance, 
it  is  now  well-known  that  the  martingale  imphcation  for  consumption  is  simply  false  in 
aggregate  time  series  data.  See  among  others  Caballero  [1988a,  b],  Christiano,  Eichenbaum 
and  Marshall  [1987],  Heaton  [1988]  and  Nelson  [1987]. 

^  After  I  had  completed  a  first  draft  of  this  paper,  Lars  Hansen,  John  Heaton,  and  Thomas 
Sargent  pointed  out  to  me  that  Hansen,  Roberds  and  Sargent  [1989]  contains  analogous 
results. 

^  See  the  quote  in  the  introduction  of  this  paper.  Their  models  however  also  imply  con- 
sumption is  not  a  martingale. 

^  We  assume  throughout  that  expectations  coincide  with  linear  projections. 

Loosely  speaking,  the  annihilation  operator  modifies  its  operand  by  removing  the  part 
in  strictly  negative  powers  of  L  in  the  operand's  Laurent  series  expansion.  See  Hansen  and 
Sargent  [1980]. 

Here  we  need  to  caJculate  the  expected  present  discounted  value  of  a  process  Y  that 
is  not  stationary.  Notice  that  the  resulting  expression  contjiins  a  singularity  on  the  unit 
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circle.  However,  the  present  discounted  value  turns  out,  nevertheless,  to  be  well- defined, 
by  the  reasoning  surrounding  equation  (A3)  of  Appendix  A  in  Hansen  and  Sargent  [1980]. 

^°  See  Deaton  [1987],  and  among  others,  Campbell  and  Deaton  [1989]  and  West  [1988]. 
This  result  is  remarkably  robust  across  alternative  specifications  for  A\  see  Diebold  and 
Rudebusch  [1989]  and  West  [1988]. 

^^  This  unit  root  characterization  will  be  maintained  in  the  subsequent  analysis  since 
"excess  smoothness"  arises  only  in  this  case. 

"^"^  Quah  [1989]  shows  how  to  construct  such  a  decomposition  where  one  component  is 
integrated,  the  other  is  stationary,  and  the  innovations  in  the  two  components  are  uncor- 
related  at  all  leads  and  lags.  That  paper  also  proves  such  an  orthogonal  decomposition 
can  always  be  found.  Tliis  is  unHke  the  orthogonaJ  decomposition  in  Watson  [1986]  which, 
under  some  circumstances,  may  not  exist.  It  is,  however,  a  maintained  assumption  that  in 
the  economy  there  are  different  structural  disturbances,  not  perfectly  correlated,  that  have 
permanent  and  transitory  effects  on  labor  income.  Finally,  note  that  in  the  current  con- 
text, the  Beveridge-Nelson  decomposition  is  not  an  interesting  one  to  consider:  When  the 
two  components  are  perfectly  correlated,  forecasts  of  future  labor  income  are  invariant  to 
whether  we  use  the  Beveridge-Nelson  decomposition  or  the  univariate  Wold  representation. 

^^  From  the  earher  footnote,  this  orthogonaHty  assumption  is  without  loss,  as  the  rep- 
resentation is  to  be  used  only  in  foreccisting  future  labor  income.  Since  the  structural 
disturbances  to  labor  income  may  be  correlated,  Y\  and  Kq  may  not  be  directly  inter- 
pretable.  A  moment's  reflection  shows  that  this  does  not  affect  forecasts  of  future  Y ^  and 
therefore  does  not  change  our  predictions  for  consumption  behavior. 
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^^  Larry  Christiano  suggested  this  terminology. 

» 

^^  Further,  this  long  run  invariance  can  be  shown  to  hold  even  when  the  permanent  and 
transitory  components  are  correlated.  See  Cochrane  [19SS]  for  the  case  when  the  pern:ianent 
component  is  restricted  to  be  a  random  walk,  and  Quah  [1989]  for  the  general  case. 

^®  These  numbers  are  for  the  BUnder-Deaton  [1985]  data,  which  is  that  typically  used  in 
studies  on  consumption  volatihty.  It  is  evident  that  alternative  "reasonable"  v;ilues  of  r  do 
not  fundamentally  narrow  this  difference  between  predicted  and  actual  variances.  Properly 
accounting  for  the  sampHng  properties  of  these  estimates  also  does  not  alter  the  conclusion 
that  consumption  appears  too  smooth  compared  to  the  predictions  of  the  model;  see  West 
[1988]. 

^^  Technically,  a  spectral  density  vanishing  at  frequency  zero  does  not  imply  the  associated 
stochastic  process  is  the  first  difference  of  another  that  is  covariance  stationary.  Quah 
[1989]  provides  regularity  conditions  for  this  implication  to  hold.  It  is  easy  to  verify  these 
conditions  are  satisfied  here. 

^^  We:  take  the  polynomial  Aid{z)  here  to  be  (1  —  0.8z)(l  —  O.SSz).  This  fixes  the  dominant 
root  in  the  AR  part  of  Aq  at  0.S5.  If  this  isn't  done,  it  might  seem  Hke  the  procedure 
simply  trades  off  a  dechning  importance  in  the  permanent  component  for  the  transitory 
component  approaching  non-stationarity.  The  exact  choice,  however,  is  arbitrary  otherwise. 

^^  This  will  then  satisfy  condition  a.  The  moving  average  form  here  is  also  known  to 
minimize  the  innovation  variance  of  a  process  that  has  spectral  density  fixed  at  frequency 
zero;  see  Quah  [1989].  However,  this  second  fact  is  not  directly  useful  here. 
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^°  The  lag  distribution  Aq  is,  in  fact,  simply  the  series  expansion  of  a  rational  function. 
The  denominator  and  numerator  parts  can  therefore  be  obtained  separately  by  a  standard 
algorithm,  such  as  that  in  Wilson  [19G9]. 

^^  This  is  the  value  that  Christiano  [1987]  uses.  West  [19S8],  on  the  other  hand,  uses  r  =  0.5 
percent.  The  results  do  not  much  depend  on  which  value  exactly  is  assumed,  as  long  as  t 
is  strictly  positive. 

^^  Although  not  presented  here,  I  have  verified  that  the  zeroes  of  the  autoregrcssive  and 
moving  average  parts  in  the  transitory  component  are  strictly  outside  the  unit  circle.  This 
property  is  guaranteed  by  the  algorithm  used  to  factor  the  spectral  density  (Wilson  [19G9]). 
The  condition  on  the  moving  average  zeroes  guarantees  that  the  representations  for  both 
Al^i  and  Yq  are  fundamental,  which  is  necessary  for  applying  the  Hansen- Sargent  formula. 

^^  Campbell  and  Deaton  [1989]  and  West  [1988]  use  more  information  than  this;  therefore 
the  statements  below  do  not  apply  to  their  work. 

^■*  This  nonfundamentalness  is  a  property  of  the  Hilbert  spaces  spanned  by  the  history  of 
the  e's  and  that  of  the  observed  sequences  Ay  and  AC  A  httle  reflection  therefore  shows 
that  it  is  invariant  to  whether  or  not  the  structural  shocks  to  the  economy  are  correlated, 
or  equivalently,  whether  or  not  e's  are  the  structural  innovations. 

'^  We  can,  without  loss,  restrict  analysis  to  the  second  moment  properties  of  the  data. 

^^  We  obtain  tliis  by  Rozanov's  [1967]  discussion  in  Chapter  1,  Section  10,  pp.  43-50.  It  is 
straightforward  to  verify  the  matrix  covariograms  imphed  by  the  right  hand  sides  of  (5.2) 
and  (5.3)  are  identical. 
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'^  This  does  not  contradict  Hansen  and  Sargent's  [1981]  results.  The  PIH  restrictions  on 
consumption  and  income  are  not  of  the  form  in  their  theorenj. 

^^  For  examples,  see  the  excellent  paper  by  Stock  and  Watson  [1988]  and  references  therein. 
An  opposing  view  is  presented  in  Sims  [1988]. 
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FIGURE  1 


A  Spectral  Density  Cleaving 
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FIGURE  2 


Labor  Income  Response  to  an  Innovation  in  the  Stationary  Component 
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